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Abstract. In this paper we study random induced subgraphs of the binary n-cube, Qj • This 
random graph is obtained by selecting each Q^-vertcx with independent probability A n . Using 
a novel construction of subcomponents we study the largest component for A n = , where 

e > Xn > n — 3 +l5 , 8 > 0. We prove that there exists a.s. a unique largest component C^K We 
furthermore show that Xn = e, \C^\ ~ a(e) 2 n and for o(l) = \ n > n~i +S , \C ( n r> \ ~ 

2x« 2™ holds. This improves the result of [B] where constant \n = X ' s considered. In 

particular, in case of A n = — — , our analysis implies that a.s. a unique giant component exists. 



1. Introduction 



1.1. Background. Burtin was the first [9] to study the connectedness of random subgraphs of 
n-cubes, Q% , obtained by selecting all -edges independently (with probability p n ). He proved 
that a.s. all such subgraphs are connected for p > 1/2 and are disconnected for p < 1/2. Erdos and 
Spencer [11] refined Burtin's result and, more importantly in our context, they conjectured that 
there exists a.s. a giant component for p n = and e > 0. Their conjecture was proved by Ajtai, 
Komlos and Szemeredi [2] who established the existence of a giant component for p n = i±i , Key 
ingredients in their proof are Harper's isoperimetric inequality [13j and a two round randomization, 
used for showing the non existence of certain splits. Several variations including the analysis of 
the giant component in random graphs with given average degree sequence have been studied 
[U [TBI IXZ] • Bollobas, Kohayakawa and Luczak [7] analyzed the behavior for e tending to and 
showed in particular that the constant for the giant component for fixed e > coincides with 
the probability of infinite survival of the associated Poisson branching process. Spencer et al. 
[5] refined their results, using specific properties of the n-cube as for instance the isoperimetric 
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inequality [13] and Ajtai et aZ.'s two round randomization idea. Considerably less is known for 
random induced subgraphs of the n-cube obtained by independently selecting each Q2 -vertex with 
probability A„. The main result here is the paper of Bollobas et. at who have shown in [6] for 
constant \ that Cn^ — (1 + °{^)) K X^r^ n ■ I n this paper we improve this result. We show that 
for Xn > n~^> +s , where S > a unique largest component exists and determine its size. The key 
observation is a novel construction for small subcomponents given in Lemma [2j 



Random induced subgraphs arise in the context of molecular folding maps [21 where the neutral 
networks of molecular structures can be modeled as random induced subgraphs of n-cubes |18j . 
They also occur in the context of neutral evolution of populations (i.e. families of Q2 _ver ti ces ) 
consisting of erroneously replicating bit strings. Here, we work of course in QJ, since we have the 
alphabet {A, U, G,C}. Random induced subgraphs of n-cubes have had impact on conceptual 
level [20 and led to experimental work identifying sequences that realize two distinct ribozymes 
|19j . A systematic computational analysis of neutral networks of molecular folding maps can be 
found in [12] . The main result of this paper is the following 



Theorem. Let Q% \ be the random graph consisting of Q -subgraphs, T n , induced by selecting 
each Q^-vertex with independent probability X n = , where e > \n > n~^ +s , e,5 > 0. Then 
we have 

(1.1) lim P( |Cl 1} I ~ a(e) 2" and C { n 1} is unique) = 1 
and for o(l) = \ n > n~^ +s 

(1.2) lim P( |C*W I ~ 2 Xn +Xn 2" and is unique) = 1. 
n— >oo n 

For Xn — ^ the above theorem (combined with a straightforward argument for X n < ^—^) implies 



(1.3) lim P(r„ has an unique giant component) = 



1 for A„ > i±£ 
for A„ < 



This is the random induced subgraph analogue of Ajtai et aUs 12! result. We present in Lemma[2] 
a novel construction of subcomponents using branching processes inductively. We prove the main 
result using a generic vertex-boundary result due to Aldous [3l 0] . All results proved in this paper 
remain valid for n-cubes over arbitrary, finite alphabets. 
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1.2. Notation and terminology. The binary ro-cube, Q 2 , is a combinatorial graph with vertex 
set ¥ 2 in which two vertices are adjacent if they differ in exactly one coordinate. Let d(v, v') be 
the number of coordinates by which v and v' differ. We set 

(1.4) ViC^jXn; B(A,j) = {v e F£ | 3 a e A; d(v, a) < 3} 

(1.5) 5(A,j) = {v e¥ 2 \3a £ A; d(v, a) = j} 

(1.6) VAC F£; d{A) = {v e F 2 l | 3 a e A; d(v, a) = 1} 

and call B(A,j) and d(A) the ball of radius j around A and the vertex boundary of A in Q 2 , 
respectively. If A = {a} we simply write B(a,j). Let A, B C F 2 l , we call A £-dense in B if 
B(v, £) D A for v £ B. Q 2 can be viewed as the Cayley graph Cay(F2 , {ej | i = 1, . . . , n}) 
where is the canonical base vector. We will view ¥ 2 as a F2-vectorspace and denote the linear 
hull over {v\, . . . , v^}, Vj 6 F2 by (ui, «2, . . . , Vh)- There exists a natural linear order < over Q 2 
given by 

(1.7) v<v' (d(v,0) < d(v',0)) V (d(v,0) = d{v',0) A v <i cx v') , 

where <i cx denotes the lexicographical order. Any notion of minimal element or smallest element 
in A C Q 2 is considered w.r.t. the linear order < of eq. (jl.7| . 



Each A C ¥ 2 induces a unique induced subgraph in Q 2 , denoted by Qsj[.A], m which ai,a 2 £ A 
are adjacent iff ai,a2 are adjacent in Q 2 . Let Q 2X be the random graph consisting of Q 2 - 
subgraphs, r„, induced by selecting each Q^-vertex with independent probability A„. That is, 
Q 2 l \ is the finite probability space ({Q 2 [A] | A C F 2 l },P), with the probability measure ¥(A) = 
Ak 41 (1 - A n ) 2 "-I A I. A property M is a subset of induced subgraphs of Q 2 closed under graph 
isomorphisms. The terminology "M holds a.s." is equivalent to linin^oo P(M) = 1. A component 
of r„ is a maximal connected induced r„-subgraph, C n . The largest r ra -component is denoted by 
Cn \ It is called a giant component if and only if 

(1.8) 3k >o, |cW|> K |r n |, 

and x n ~ y n is equivalent to (a) linin^oo x n /y n exists and (b) linin^oo x n /y n = 1. Let Z n = 
& be a sum of mutually independent indicator random variables (r.v.), £j having values in 
{0, 1}. Then we have, [10], for rj > and c„ = min{- ln(e''[l + ^} 

(1.9) Prob( I Z n - E[Z n ] I > rjM[Z n ] ) < le^^ . 

n is always assumed to be sufficiently large and e is a positive constant satisfying < e < -|. We 
use the notation B m (£,X n ) = (™)Afj (1 - \ n ) m ~ e and write g(n) = 0(f(n)) and g(n) = o(/(n)) 
for g(n)/ f(n) ~ > n as n — * 00 and g(n)/f(n) — > as n — > 00, respectively. 
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2. Preliminaries 



Let us briefly recall some basic facts about branching processes [T31 [H] ■ Suppose £ is a random 
variable and (Q ), i,t € N counts the number of offspring of the ith-individual at generation t — 1. 
We consider the family of r.v. (Zi) ie j>s : Zq = 1 and Z t = Ym=\ C f° r * > 1 an d interpret Z 4 as 
the number of individuals "alive" in generation t. We will be interested in the limit probability 
lim^oo Prob(Z f > 0), i.e. the probability of infinite survival. We have 

Theorem 1. Let u n — , A n = 1 ~ l w x " , m = n — [|u n nj and Prob(£ = t) = B m (i,X n ). Then 
for Xn = e fie r.w. £ becomes asymptotically Poisson, i.e. P(£ = ~ ^-4^ e"^ 1+e - ) and 

(2.1) < lim Prob(Z t > 0) = a(e) < 1 . 

t — >oo 

For o(l) = Xn > 5 > me Ziaue 

(2.2) lim Prob(Zi > 0) = (2 + o(l)) X „ . 



In view of Theorem [I] we introduce the notation 

(2.3) .(Xn) = [ a{e) Xn = E 

(2{l + o{l))xn for o(l) = X n > n~3+ s . 

We procceed by labeling the indices of a Q2~ ver t ex v = (^lj • • • > x n)- F° r this purpose set 

1 k 

(2.4) Vn = I — ; -u n n\, in — I it„n| , and z„ = fezA, + t„ . 

v i n i 2 k(k + l) J ' L 2fc + 1 J ' 

We write an Q^-vevteii v = (xi, . . . , x n ) as 

(2 51 fa; (1) x {2) x {2) x {k+1) x {k+1) x ^ x ) 

s v J s v ' s v s v / 

^coordinates ^coordinates ^coordinates ™ — 

ii- Lj"n"J coordinates 

For any 1 < s < r = 1, . . . , k we set to be the s + (r — l)z/ n th-unit vector, i.e. has 
exactly one 1 at its (s + (r — l)u n )th coordinate. Similarly let ei k+1 \ 1 < s < i n denote the 
(s + fci/„)th-unit vector. We use the standard notation for the z„ + 1 < t < n unit vectors, i.e. et 
is the vector where xt = 1 and arj = 0, otherwise. 

In our first lemma we use Theorem [T] in order to obtain information about small components in 

r„. 
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Lemma 1. Suppose X n = and e > \n > n 3+ s r where 5 > 0. Then each T n -vertex is 

contained in a T n - subcomponent of size [ju n n\ with probability at least ir(x n )- 

Proof. We consider a branching-process in the sub-cube Q^~ z " (eq. Ij2.5|> ). W.l.o.g. we initialize 
the process at v = (0, . . . , 0) and set Eo — {e„_ Zn +i, . . . , e„} and Lq°' = {(0, . . . , 0)}. We consider 
the n — L|w„nJ smallest neighbors of v. Starting with the smallest we select each of them with 
independent probability A„ = ■ Suppose v + ej is the first being selected. Then we set 

Ei = E \ {ej} and = Lq U {e^} and proceed inductively setting E s = E s _i \ {e w } and 
4 0) = Lf\u{e w } for each neighbor v + e w being selected. We are given the following alternative, 
either we have (a) \E S \ = n — ([ju n n\ — 1) or (b) \E S \ > n — (|_|ti„nj — 1) and all smallest 
n — |_§ u n n\ neighbors of (0, . . . , 0) are checked. In case of (a) [j u n n\ — 1 vertices have been 
connected and since 

n - z n - (L^ u n n\ -l)>n - ^u n n - ^u n n + 1 > n - [^u n n\ 

there are still >n — [ju n n\ neighbors available. Suppose \E S \ > n — ([ju n n\ — 1) and all smallest 
n — [f u n n\ neighbors of (0, ... ,0) were examined. Then we proceed by choosing the smallest 
element of \ {0}, v\ and set L^p = l\® \ {v{}. By construction, v* has at least n — L| u n n J 
neighbors of the form v* + e r e r G E s . We begin with the smallest of these and continue selecting 
with probability 1+ ^ n setting E s — E s -x \ {ej} and L^p = Lji\ U {v{ + e 3 } for each neighbor 
vl + ej being selected. We continue inductively setting = Ef~^ \ {v*} and stop in case of (a). 
By construction, this process yields an induced sub-tree of Q^ -2 " • Theorem Q] guarantees that we 
have a r n -subcomponent of size |_7U n nJ with probability at least 7r(% n ). □ 

We refer to the particular branching process used in Lemma[T]as 7-process. The 7-process produces 
a subcomponent of size [jMn^J , which we refer to as 7-(sc). 



3. Small subcomponents 

The 7-process employed in Lemma [T] did by construction not involve the first z n coordinates. 
In the following lemma we will use the first k v n of them in order to build inductively larger 
subcomponents (sc). 
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Lemma 2. Let k € N be arbitrary but fixed, X n = , v n — l 2 ktk+i) -I an ^ ^™ = 7r (Xn)^n(l — 
e^+x™)"™/ 4 ). TTien i/iere exists pk > suc/i t/iai eac/i r„-weriea: is iwi/i probability at least 

(3-1) Tfc(Xn) =■*(*„) (i_ e -^») 

contained in a I subcomponent of size at least (u n n)(p^, where Ck > 0. 

Lemma [2] gives rise to introduce the induced subgraph r^fc = Q^tAI where 

(3.2) A = | i! is contained in a r„-(sc) of size > Ck (u n n)tp^, Ck > 0} . 

In case of e > Xn > n~^ +s we have 1 — e _ 3 ( - 1+x "- )tl " > u n /4 and consequently ^ n > d (1 + 
o(l))x n u^ n > Co n for some c', Co > 0. Furthermore 

(3.3) L^««nJ > Cfc nin fe5 , c fc > . 

Accordingly, choosing k sufficiently large, each r„-vertex is contained in a (sc) of arbitrary poly- 
nomial size with probability at least 

(3.4) tt(x„) (l - e-P" nS ) , 0<6,0< Pk . 

Proof. Since all translations are Q2~ au t° m orphisms we can w.l.o.g. assume that v = (0, . . . , 0). We 
use the notation of eq. (|2.5p and recruit the n — z„-unit vectors et for a 7-process. The 7-process of 
Lemma [T] yields a 7-(sc), C(0), of size L-jU„nJ with probability > 7r(x„). We consider for 1 < i < k 
the sets of v n elements Bi = {e£ , . . . , e^i} and set H = (e Un +i, ■ ■ ■ , e n ). By construction we have 

(3.5) (B f U( (J B.j) ® H) = (Bi) 8 ( (J B;j)®H. 

1<J'<»-1 l<j"<i — 1 

In particular, for any 1 < s < j < ei 1 ^ — e~- S i? is equivalent to = . Since all vertices 
are selected independently and |C(0)| = [jWn^J > for fixed ei 1 ^ G i?i the probability of not selecting 
a vertex v' G + O(0) is given by 

(3.6) p({e( 1} +£ I e e c(0)} nr n = 0) = 1 1 - ^ e -(i+xn)i«« . 

We set ji n = (1 - e "( 1 +x n )i« n ) ; i. e . Mn = P ((ei 1 ' + (7(0)) flL^j and introduce the r.v. 



(3.7) Xj = |{e« G Bi | 3£ G C(0); e« + £ G r„} 
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Obviously, ¥,(X±) = [i n v n and using the large deviation result of eq. (|1.9p we can conclude that 

(3.8) 3p > 0; P \Xi < < e-P^ Vn . 

Suppose for there exists some £ € C(0) such that + £ 6 T„ (that is e$ is counted by 
Xi). We then select the smallest element of the set {e^ + £ | £ G C(0)}, say e« + £o and 
initiate a 7-process using the n — z n elements {e» B +i, . . . , e„} at + £o- The process yields 
a 7-(sc) of size [| w n"-J with probability at least n(Xn)- For any two elements , with 
ei 1 ^ + ^(ei 1 ^), + £(e^) S T n the respective sets are vertex disjoint since (B± U H) = (Bi) H. 
Let X\ be the random variable counting the number of these new, pairwise vertex disjoint sets of 
7-(sc) of size L^w„nJ . By construction each of them is connected to C(0). We immediately observe 
> 7r(x„)/i n ^„ and set ip n = ir(xn)t^n,Vn- Using the large deviation result in eq. (|1.9p we 

derive 

(3.9) 3pi>0; P (xi < ^tpA < e- p ^ . 

We proceed by proving that for each 1 < i < k there exists a sequence of r.v.s (Xt,Xs, ■ ■ ■ ,Xi) 
where Xi counts the number of pairwise disjoint sets of 7-(sc) added at step 1 < j < i such that: 

(a) all sets, , 1 < j < i, added until step i are pairwise vertex disjoint and are of size 

(b) all sets added until step i are connected to C(0) and 

(3.10) 3 Pl > 0; P (Xi < ^{fnfj < e- p ' Vn , where <p n = n{Xn)»nV n ■ 

We prove the assertion by induction on i. Indeed in our construction of X\ have already established 
the induction basis. In order to define Xi+i we use the set = {e^ +1 \ . . . , e^„ +1 ^}. For each 

Ca counted by Xi (i.e. the vertices that were connected in step i) we form the set ei* +1 ' ) + C a l \ 
By induction hypothesis two different Ca , Ci/ , counted by X{ are vertex disjoint and connected 
to C(0). Since {B i+1 ) 0(Ui<j<j Bj) ®H are disjoint we can conclude 

(s + s' V a ± a) => (ei l+1 ' + C« ) fl (e£ +1) + C« ) = 

and the probability that we have for fixed Ca : (e^ 1 + C« ) n T n — for some e^ +1 ^ G is 
exactly as in eq. (|3.6f) 

= M- +X " 1 ^ p -(l+Xn)J«« 



((ei' i+1 )+c«)nr n = 0) 
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As for the induction basis, fj, n = (1 - e -( 1 +Xr»)i« n ) [ s the probability that (e^ +1) + C^) C\T n ^ 0. 
We proceed by defining the r.v. 

(3.H) AVi = ]T|{4 m) £% |3£eC« ;e i* +1 >+£er„} . 

counts the number of events where (e s l+1 ' + Cq" 1 ) (~l r„ ^ for each Ca , respectively. For 
fixed Ci i} and fixed ei i+1) G B l+1 we choose the minimal element 

+ £o,a G + & | & G C « , + £ Q G T n } . 

Then X^+i counts exactly the minimal elements ei l+1 ^ + £o,cn e*/ + £o,a'j ■ ■ • f° r all C« , , . . . 
and any two can be used to construct pairwise vertex disjoint 7-(sc) of size [ju n n\. We next 
define Xi+± to be the r.v. counting the number of events that the 7-process in H initiated at the 
eg ^ + £o.a G r„ yields a 7-(sc) of size [ju n n\. By construction each of these is connected to a 
unique c£ . Since (B i+ i) 0(Ui<j<i Bj) ®H au newly added sets are pairwise vertex disjoint to 
all previously added vertices. We derive 

2 



X i+ i < ttP? 1 ) < P (Xi < ^C] + P (x i+l < A X > I^J 



failure at stop i failure at step t + 1 conditional to X t > -i<^* 

±e- p < + \l-e- piV "), p>0 
< e~ Pi + 1,pn . 

Therefore each T„-vertex is with probability at least ir(xn) (1 — e~ Pk<Pn ) contained in a r„-(sc) of 
size at least Cfe (Xnn)<fini f° r c& > and the lemma is proved. □ 



We next prove a technical lemma which will be instrumental for the proof of Lemma |H We show 
that the number of vertices not contained in T n k is sharply concentrated, using a strategy similar 
to that in Bollobas et.al. [7J. Let U n denote the complement of T n k in T n . 

Lemma 3. Let k G N and A„ = 1 ~ l n x " , where e > Xn > n ~i +s . Then we have 
(3.12) P (\ \U n \ - E[\U n \] | > -n\U n \}) = o(l) . 



Proof. Let X v be the indicator variable for the event v G U n , i.e. v is contained in a r„-(sc) of size 
< Cfe (u n n)(pn- Clearly \U n \ = Xuer X v - In order to prove our concentration result we proceed 
by estimating V[|Z7 n |]. Suppose v ^ v'. There are two ways by which X v ,X v i viewed as r.v. over 
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Q2 x can be correlated. First v,v' can belong to the same component in which case we write 
v ~i v'. Clearly, 

(3.13) J2 E i X v X *} < c fe E[\U n \] . 

Secondly, correlation arises when v, v' belong to two different components C v , C' v > having minimal 
distance 2 in Q^- in this case we write v ~2 v'. Then there exists some Q2 -vertex, w, such that 
{w} C d(C v ) n d(CV) and we derive 

¥(C V and C v , are r„-(sc), d{C v ,C v >) = 2) = 1 ~ A " ¥(C V U C v > U {w} is a r„-(sc)) 

< n P(C„ U C„/ U {w} is a r„-(sc)) . 

Since for v ^2 v' we have ¥(C V and CV are r„-(sc), d(C v , C v >) = 2) = E[X V X v >] we can immedi- 
ately give the upper bound 

(3.14) E ^ X v x «] < n (2c fe (u n n)(p h n + l) 3 |r n | . 
The uncorrelated pairs [X Vl X v >), writing v v\ can easily be estimated by 

(3.15) E i x v x v>] = E i x vMXv] < n\u n \} 2 . 

Since Y[\U n \] = E[\U n \(\U n \ - 1)] + E[|[/„|] - E[|[/„|] 2 we have 

E[|ig(|ig - 1)] = J2 E[X„ JT /] + ^ E[X V X V ,} + Y E \XvX v ,] 

< c k {u n n)<p k n E[\U n \] + n (2c k {u n n)ip k n + l) 3 \T n \ + E[\U n \} 2 . 
Considering isolated vertices in T n immediately implies E[|{7 n |] > c|r„| for some c > 0, whence 



V[|C/ n |]/E[|C/„|] 2 < a + ^M^ + n(2c k (u n nU + ir = ^ 

C 1 n \ 71 



Via Chebyshev's inequality P(\\U n \ - E[\U n \]\ > £E[|U n |]) < n 2 gn^ro holds, whence the lemma. 

□ 

Lemma 4. Let \ n — 1+ * n where e > \ n > n~i +s . Then we have for sufficiently large k E N 
(3.16) (l-o(l))7r(x„)|r n |<|r n , fe |<(l + o(l))7r(x„)|r n | a.8. 



In view of Lemma[2]the crucial part is to show that there are sufficiently many r„-vertices contained 
in r n -(sc) of size < c k u n n^. For this purpose we use a strategy introduced by Bollobas et.al. [7] 
and consider the n-regular rooted tree T n . Let v* denote the root of T n . Then v* has n descendents 



10 



CHRISTIAN M. REIDYS 



and all other T„-vertices have n — 1 Selecting the T n -vertices with independent probability A„ we 
obtain the probability space T n ^\ n whose elements, A n , are random induced subtrees. We will be 
interested in the A„-component which contains the root, denoted by C v * . Let and £„, for v =/= v* 
be two r.v. such that Prob(£„. = i) = B n (£, A„) and Prob(£„ = I) = B n -\(£, A„), respectively. We 
assume that £„* an d £u count the offspring produced at v* and Then the induced branching 

process initialized at v* , {Zi)i^i a constructs C v *. Let 7To(x) denote its survival probability, then 
we have in view of Theorem Q] and [7], Corollary 6: 

(3.17) 7To(Xn) = (l + °(l))7r( X „). 



Proof. Claim 1. |r„, fc | > ((1 - o(l)) 7r(x„)) |L„| a.s. 

According to Lemma [2] we have E[|{7 n |] < (1 — ^k{Xn)) |r„| and we can conclude using Lemma [3] 
andE[|[g] =0(|r„|) 

(3.18) \U n \ <(l + O(i)) E[\U n \] <(l- (n k ( X n) - °(^))J l r «l a - s - 



In view of eq. (|3.1|) and \ n >n 3 + we have for arbitrary but fixed k, 

7r fe (Xn)-0(i) = (l- (l))7r( Xn ) • 

Therefore we derive 

(3.19) |r nifc |>(l-o(l))7r(x„)|r n | a.s., 
and Claim 1 follows. 

Claim 2. For sufficiently large k, L„^ < ((1 + o(l)) n(xn)) T„ a.s. holds. 
For any fixed Q^-vertex, v, we have the inequality 

(3.20) P(|C„.| <£) <P(|G»| <£) . 

Indeed we can obtain C v by inductively constructing a spanning tree as follows: suppose the set of 
all CVvertices at distance h is M h " . Starting with the smallest w £ M^ 1 ' (h > 1) there are at most 
n — 1 u;-neighbors contained in that are not neighbors for some smaller w' £ M^ v . Hence 

for any w £ M h " at most n — 1 vertices have to be examined. The A re -component C v * is generated 
by the same procedure. Then for each w £ M h v there are exactly n — 1 neighbors in M h ^_ 1 . 
Since the process adds at each stage less or equally many vertices for C v we have by construction 
\C V \ < |CV;*|. Standard estimates for Binomial coefficients allow to estimate the numbers of T n - 
subtrees containg the root [7|, Corollary 3. Since vertex boundaries in T n are easily obtained we 
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can accordingly compute P(|C„«| = £)■ Choosing k sufBciently large the estimates in [7], Lemma 
22, guarantee 

(3.21) Pfla.1 <c k u n n<p k ) = (l-n (xn)) + o(e- n ) . 

In view of P (\C V , | < t) < P (\C V \ < I) and eq. (|3~T7j) we can conclude from eq. (j^2T|) 

(3.22) (1 - (1 + o(l))7r( X n)) |r„| + o(l) < E[\U n \] . 
According to Lemma[3]we have (1 — 0(i)) E[\U„ |] < \U n \ a.s. and therefore 

(3.23) (l-(l + o(l) + 0(i))7r(Xn))|r n |<|E/ B | a.s. 

n 

Eq. (|3~Tg)) and eq. (|3~2"31 imply 

(3.24) (1 - (1)) 7r(x„)|r n | < |r„, fc | < (1 + o(l)) 7r(x„)|r n | a.s., 

whence the lemma. □ 



Finally we show that T n ^ is a.s. 2-dense in Q% with the exception of 2™ e~ A " vertices. Accordingly 
T nt k is uniformly distributed in T n . The lemma will allow us to establish via Lenima[7]the existence 
of many vertex disjoint short paths between certain splits of the r^-vertices. 

Lemma 5. Let k G N and X n = 1+ * n and e > Xn > n _ 3 +l5 . TTien me have 

(3.25) 3A>0;V«GF», IP ^|S(^, 2) n r„, fc | < | ^-^-—^^ n s j < e" A ^ 

Let D s = {v\ |S( V ,2)nr„ !fc | < i (^y) 2 ^}, then 

(3.26) | Da | < 2" e~ AnS a.s., where A > A > . 

Proof. To prove the lemma we use the last (eq. (|2.5|1 ) t n = [ 2 ^ fc fc | _ 1 ^ it n nJ elements , . . . , el k+1 . 

We consider for arbitrary u <E 

(3.27) S (fc+1) (v, 2) = {v + ef +1) + ef +1) | 1 < i < j < i n , } . 

Clearly, \S (k+1} (v,2)\ = ('") holds. By construction the T„-(sc) of size > Cfe (u n n)y„ of Lemma [2] 
are vertex disjoint for any two vertices in S k+1 \v, 2) nr„ and each r„-vertex belongs to T nt k with 
probability > TTk(Xn)- Let Z be the r.v. counting the number of vertices in S^ k+1 \v,2) n T n ^- 

Then we have E[Z] ~ ( 2 (fc+i) ) if n7r (Xn)- Eq. (|3.25|) follows now from eq. (|1.9p . u^riXn > 
and P(|S(u,2) nT„^| < 77) < P(|S (fc+1) (w, 2) n r„ >fe | < 77). Let now L> 5 = {w | |S(u,2) n 
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r n ,fc| < \ ^ 2 (fc+i) j n& }- By linearity of expectation E(|Z?5|) < 2"e An " holds and using Markov's 
inequality P(X > tE{X)) < 1/t for t > 0, we derive that \D S \ < T l e~ Kn& a.s. for any < A < 
A. □ 



4. Vertex boundary and paths 



The following Proposition is due to [5] used for Sidon sets in groups in the context of Cayley 
graphs. In the following G denotes a finite group and M a finite set acted upon by G. 

Proposition 1. Suppose G act transitively on M and let A C M , then we have 

(4.1) J- J2\AngA\ = \A\ 2 /\M\ . 

1 1 gee 



Proof. We prove eq. (|4.ip by induction on \A\. For A = {x} we derive p E 9 i=i 1 = I^I/I^Ij 
since \M\ — \G\/\G X \. We next prove the induction step. We write A = A U {x} and compute 

-L\2\ An 9A\ - -^Y.(\ A o n 9A\ + \{gx}nA Q \ + \{x}ngA \ + \{gx}n{x}\ 
= ^(\Ao\ 2 \G x \ + 21^11^1 + 1^1) 

- >oi + imi)^. 

□ 



Aldous [Sid] observed how to use Proposition [T] for deriving a very general lower bound for vertex 
boundaries in Cayley graphs: 

Lemma 6. Suppose G acts transitively on M and let A C M , and let S be a generating set of the 
Cayley graph Cay(G, S) where \S\ = n. Then we have 

(4.2) 3s eS; \ a A\A\>-\A\(l-j§r). 

n \M\ 
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Proof. We compute 

(4-3) \A\ = p Y^(\gA \A\ + \An gA\) = E 1^ \ A l + l A l |^[ 

and hence |A|(1 — Jjj|t) = r^r ^2 g \gA \ A\. From this we can immediately conclude 

3g eG; |^ \ A\ > \A\{1 - . 
Let <? = Oj=i s j- Since each element of gA \ A is contained in at least one set sjA \ A we obtain 

k 

\gA\A\Kj2\sjA\A\ . 
j=i 

Hence there exists some 1 < j < k such that |sj-A \ A\ > j:\gA \ A\ and the lemma follows. □ 



The next lemma proves the existence of many vertex disjoint paths connecting the boundaries of 
certain splits of r^-vertices. The lemma is related to a result in [5] but much stronger since the 
actual length of these paths is < 3. The shortness of these paths results from the 2-density of 
Tn./c (Lemma [5| and is a consequence of our particular construction of small subcomponents in 
Lemma [5] 

Lemma 7. Suppose X n = 1+ * n where \ n = e or o(l) = Xn > n^^ +s Let (A, B) be a split of the 
Tn^-vertex set with the properties 

(4.4) 30<o- <ffi<l; 42 n < |A| =<7o|r n , fc | and \ 2 n < \B\ = oi|r„,fc| . 
Then there exists some t > such that a.s. d(A) is connected to d(B) in via at least 

vertex disjoint (independent) paths of length < 3. 



(4.5) -j2V 



Proof. We consider B(A,2) and distinguish the cases 

(4.6) |B(A2)| < |2" and \B(A, 2)| > | 2" . 

Suppose first \B(A, 2)| < | 2" holds. According to Lemma[6]and eq. (|4.4[) we have 

(4.7) 3d x >0; |d(B(A2))| > ^2" 
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and Lemma [5] guarantees that a.s. all except of at most 2"e" Q^-vertices are within distance 
2 to some T^-vertex. Hence there exist at least 2™ vertices of d(B(A, 2)) (which by definition 
are not contained in B(A, 2)) contained in B(B,2) i.e. 

(4.8) |dB(A,2) n B(S,2)| > 4 2 " a.s. 

For each j3 2 G d(B(A, 2)) n B(B. 2) there exists a path (c&i, «2, /?2), starting in d(A) with terminus 
/3 2 . In view of B(B, 2) = d(B(_B, 1))UB(B, 1) we distinguish the following cases 

(4.9) |d(B(A,2))nd(B(B,l))| >^d 2>1 2 n and |d(B(A, 2)) n B(B, 1)| > \ d %2 2™ . 

Suppose we have |d(B(A, 2)) n d(B(fl, 1))| > ^d 2 ,i 2". For each /3 2 G d(B(B, 1)) we select some 
element (3i(0 2 ) G d(£?) and set £?* C d(i?) to be the set of these endpoints. Clearly at most n 
elements in B(B, 2) can produce the same endpoint, whence 

Let B\ C B* be maximal subject to the condition that for any pair of -Bi-vertices ((3i,f3[) we have 
d(/3i,/3() > 6. Then we have |S X | > |-B*|/(") since \ B(v, 7)\ = ("). Any two of the paths from d(A) 
to i?i C d(i?) arc of the form (ai, a 2 , [3 2 , (3\) and vertex disjoint since each of them is contained in 
B(/?i,3). Therefore there are a.s. at least 

vertex disjoint paths connecting d(A) and d(B). Suppose next |d(B(A, 2)) n B(B, 1)[ > A c?2,2 2 r ' 
We conclude in complete analogy that there exist a.s. at least 



(4.10) -d 2 ,i 2"/ 



(4-11) \ d22 T/( n 

n A \ 5 



vertex disjoint paths of the form (ai, a 2 ,/3 2 ) connecting d(A) and d(B). It remains to consider 
the case \B(A, 2)| > |2". By construction both A and B satisfy eq. (|4.4p . respectively, whence it 
suffices to assume that also \B(B, 2)| > |2 n holds. In this case we have 



|B(A,2)nB(S,2)| > 12" 

and to each a 2 G B(A,2) n B(_B,2) we select ai G d(A) and /3j G d(B). We derive in analogy to 
the previous arguments that there exist a.s. at least 

(4.12) ^d 2 2 n /( U 

pairwise vertex disjoint paths of the form (ai, a 2 ,Pi) and the proof of the lemma is complete. □ 
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5. The largest component 



Theorem 2. Let Q% x be the random graph consisting of Q% -subgraphs, T n , induced by selecting 
each Q^-vertex with independent probability \ n . Suppose A„ = 1+Xn , where e > %„ > n~z +s , 
5 > 0. Then we have 

(5.1) lim P ( \Ci 1} \ ~ n(xn) 1 + Xn 2™ and Cft is unique) = 1 . 



Proof. Claim. We have \Cn \ ~ \^ n ,k\ a.s. 

To prove the Claim we use an idea introduced by Ajtai et.al. [2] and select Q2 -ver ti ces in two 
rounds. First we select Q2 ~ ver tices with independent probability 1+ *"^ 2 and subsequently with 
The probability for some vertex not to be chosen in both randomizations is (1— 1+ *"^ 2 )(1 — x "/ 2 ) = 
1 — 1+ n" t " 1+Xn ^ >Xn ^ 2 > 1 — 1+Xn ■ Hence selecting first with probability 1+ *"^ 2 (first round) and 
then with Xn ' 2 (second round) a vertex is selected with probability less than 1+x " (all preceding 
lemmas hold for the first randomization 1+x "^ 2 ) . We now select in our first round each Q 1 ^ -vertex 
with probability 1+ *"^ 2 ■ According to Lemma 0] 

(5.2) |r„, fe | ~7r(xn)|r»| a.s. 
Suppose T nt k contains a component, A, such that 

A 2 " < 1^1 < (i-6)|r«,*|, b>0 

then there exists a split of T n ^, (A, B) satisfying the assumptions of Lemma[7] (and d(A) fld(-B) = 
0). We now observe that Lemma [2] limits the number of ways these splits can be constructed. In 
view of 

(5.3) \^u n n\^>c k rfin k5 , c k > 

each A- vertex is contained in a component of size at least c k nin kS . Therefore there are at most 

(5.4) 2 (a»/(c*»3«")) 

ways to choose A in such a split. According to Lemma [7] there exists t > such that a.s. d(^4) is 
connected to d(B) in Q2 via at least ^ 2 n / (™) vertex disjoint paths of length < 3. We now select 
Q2 -vertices with probability None of the above > ^2"/(!) paths can be selected during 
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this process. Since any two paths are vertex disjoint the expected number of such splits is less 
than 

■ ■*•/(?) 



(5.5) 2 (2"/( C *««™ w )) L^X^j y ^2( 2 " /(cfcn8 " fc5) ) e 7 



2"/(?) 



Hence choosing k sufficiently large, we can conclude that a.s. there cannot exist such a split. 
Therefore \Cn\ ~ |r ra ,fc|, a.s. and the Claim is proved. According to Lemma[4]we therefore have 
\C'n ^\ ~ 7r(x«) In particular, for Xn = (■ Theorem[T](0 < a(e) < 1) implies that there exists a 
giant component. It remains to prove that C« is unique. By construction any large component, 
C' nl is necessarily contained in T n ^- In the proof of the Claim we have shown that a.s. there cannot 
exist a component C' n in T n with the property \C' n \ > |r„|. Therefore Cn is unique and the 
proof of the theorem is complete. □ 



Theorem [3] below is the analogue of Ajtai et.al.'s result [2] (for random subgraphs of n-cubes 
obtained by selecting Q^-edges independently). 

Theorem 3. Let Q% \ be the random graph consisting of Q% -subgraphs, T n , induced by selecting 
each Q^-vertex with independent probability X n . Then 



(5.6) lim P(r n has an unique giant component) 



1 for X„ > i±£ 
for X n < — 



Proof. We proved the first assertion in Theorem [2] It remains to consider the case A ra = . 
Claim. Suppose A„ = then there exists k' > such that |C^| < n' n holds. 
The expected number of components of size i is less than 

(5.7) Jr*"(i^)'-£r<i-.)' 

since there are 2 n ways to choose the first element and at most n-vertices to choose from subse- 
quently. This component is counted I times corresponding to all £ choices for the "first" vertex. 
Let X K > n be the r.v. counting the number of components of size > k' n. Choosing k' such that 
(1 - e) K ' < 1/4 we obtain 

(5.8) E(AV„)< £ -L 2 »(l- e )<< -l2"(l-er'"£(l- e )<< \ 
whence the Claim and the proof of the theorem is complete. □ 



l-(l-e) 
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